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incorporated between a plurality of execution unite such as a memory access unit 
and an integer operation unit. The data on the bypaths is directly transferred to 
target units according to a control signal generated by a bypath processing 
control circuit . 
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A second decoder (114) of an instruction decode unit (119) decodes an operation 
code for a multiply-add operation, and a second operation unit (117) receives two 
data stored in a register file (115) to perform the multiply-add operation. In 
parallel with the operations of the second decoder (114) and the second operation 
unit (117), a first decoder (113) of the instruction decode unit (119) decodes an 
operation code for 2 data load, and an operand access unit (104) causes two data 
(e.g., n bits each) stored in an internal data memory (105). to be transferred in 
parallel in the form of combined 2n-bit data to a first operation unit (116) . 
Then, two predetermined registers of the register file (115) store the respective 
n-bit data from the first operation unit (116) . 
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Entered in the architecture information table 21 is architecture information such 
as the number of available registers, instruction sets, instruction latency- 
indicating the instruction delay, etc., for the computer in which a program to be 
compiled is executed. In the example shown in FIG. 3, the number of registers is 
32 for both general -purpose registers and floating point registers. Set in the 
instruction set column is information for use in converting instructions into 
standardized internal instructions, for example, an add instruction into an 
internal operation code INST . sub . 13 ADD, an addl instruction into an internal 
operation code INST. sub. 13 ADD. sub.-- L, etc. Entered in the instruction latency 
column is instruction delay in the four cases where two instructions are not 
dependent on each other; two instructions are definition-reference related; two 
instructions are reference-definition related; and two instructions are 
definition-definition related. In this example, the delay of an add instruction 
and an addl instruction is represented by 1, and the delay of a dependent fadd 
instruction (floating point add instruction) is represented by 4. Other 
information about the number and types of operating units, etc. is entered in the 
architecture information table 21. 
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A compiling apparatus and method in which instructions are scheduled for an 
efficient parallel process with a register allotting process and an instruction 
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and renames registers by replacing the register numbers used by the instructions 
with other register numbers according to the collected register information and 
the analysis of definition/reference instruction dependency. The instructions are 
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very long instruction word multi- threaded microprocessor architecture. In the 
static interleaving technique, each function unit in the processor is allocated 
for the execution of an instruction from a particular thread in a fixed 
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the instruction threads. The strategy of resource allocation is exposed to a 
parallel compiler which organizes a sequence of instructions into the horizontal 
instruction words which form each thread so as to maintain the data dependencies 
among the instructions and to take into account the fixed predetermined allocation 
of hardware resources. 
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ART-UNIT: 2 73 

PRIMARY -EXAMINER: Treat; William M. 
ATTY- AGENT -FIRM: McDermott, Will & Emery 



In a processor employing a delayed branch method, delayed branch control which 
does not complicate instruction execution sequence and improves the readability of 
a program on the assembler level is implemented without providing a control bit in 
an instruction code. The delayed branch control according to the present invention 
involves the use of a branch- information storing circuit for storing the 
occurrence or nonoccurrence of a branch in a specified one of a continuous 
sequence of cycles immediately before a current execute cycle which are equal in 
number to delay slots in the processor. In executing a delayed branch instruction, 
when the branch- information storing circuit stores the occurrence of a branch in 
the specified cycle, a branch is disabled. This prevents instruction execution 
sequence from being complicated even when individual branch conditions for 
consecutive delayed branch instructions are satisfied, so that the program on the 
assembler level is improved in readability. The branch- information storing circuit 
can simply be composed of a combination of a shift register, a counter, and a 
latch. 

6 Claims, 17 Drawing figures 
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INT-CL (IPC) : G06F 1/02; G06F 7/60; G06F 9/34; G06F 12/00 
ABSTRACT : 

PURPOSE: To contrive the effective use of data supplied to a delay register by 
providing a delay register that holds input data and supplying the held data 
selectively to a data memory. 

CONSTITUTION: Data read out from a data memory DM are always supplied to a delay 
register DR regardless of the content of arithmetic processing. Supplied data are 
held until succeeding arithmetic process instruction executing cycle even when a 
succeeding instruction execution cycle is a transfer instruction execution cycle 
or jump instruction execution cycle. The held data are made writable in the memory 
DM in succeeding arithmetic process instruction execution cycle. Thus, data 
supplied to the register DR can be utilized effectively, and at the same time, the 
high speed processing of data and reduction of the number of program steps can be 
realized. 

COPYRIGHT: (C) 198 7 , JPO&Japio 
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ABSTRACTED- PUB* NO: EP 556825A 
BASIC -ABSTRACT: 

The microinstruction based processor includes a ROM (102) supplying 
microinstructions to a selector (104) with a repeat register ( 103 ) . A control 
circuit (107) for the partial instruction selector (106) . Instructions are fed from 
this selector to a decoder(109) partly through an instruction delay register ( 108 ) . 
The instruction is then executed in the operating section (110) . 

One field (212) from the microinstruction controls two data buses and a group of 
registers directly. The field from the delay register controls a third data bus 
and register, and the operating code is delayed by one cycle. 

ADVANTAGE - Reduces the number of cycles required to perform instructions and also 
reduces the number of registers required. 

DD 

ABSTRACTED -PUB -NO: EP 55682 5 A 
EQUIVALENT -ABSTRACTS : 

CHOSEN- DRAWING : Dwg.9/12 
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ABSTRACTED- PUB -NO: SU 886000B 
BASIC- ABSTRACT: 

The analyser is designed for use in both universal and specialised digital 
computers . 

A register (15) stores the address of the preceding instruction in the case of an 
interruption, whilst a counter (19) stores the address of the current instruction . 
If the latter causes disruption of the normal instruction sequence, its address is 
fed into a local memory (10) . A signal from a delay element (20) increases the 
contents of the counter by 1 and is also fed to a 2nd delay element (21) , which 
causes the contents of a program state register (6) to be fed into a buffer (15) 
and also energises a shift register (14) , which shifts the instruction length code 
from the program state register, a jump sign, and an interruption sign. 

When an instruction which caused an interruption is executed, the interrupt 
signals are fed by an OR element (16) to the jump sign register (13) , and the 
address of this instruction is stored in the buffer for the whole of the duration 
of the interruption. Bui. 44/30. 11 .81 

ABSTRACTED- PUB -NO: SU 886 0 00B 
EQUIVALENT- ABSTRACTS : 
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COPYRIGHT 2000 DERWENT INFORMATION LTD 

TITLE: Loop instruction data processor - uses counter, address register, program 
counter, clock, ROM, instruction register and loop counter 

ABEQ : 

A processor for treating data, comprising: a programme memory (1) for storing a 
plurality of instructions, including a programme loop control instruction which 
is defined by an intial address and a final address and which comprises a 
repetition field for defining the desired number of loop repeats, a programme 
counter (5) with a first step-wise control for applying the addresses to said 
programme memory, loop counting means (35) for counting the number of loop 
repeats after each passage of the initial address, a first register (30) for 
storing said initial address, first detection means for detecting the appearance 
of the final address in order to load the programme counter with the contents of 
said first register, second detection means which are coupled to said counting 
means and which operate as a function of the repetition field so as to inhibit 
said first detection means upon the appearance of the desired number of repeats 
in order to enable the programme counter to abandon said programme loop, 
characterised in that the control instruction also comprises a delay field for 
defining an addressing interval betv.'sen the control instruction and said initial 
address, and in that for treating said delay field the processor comprises: a 
delay counter (50) which is loaded as a function of the delay field with a second 
control for operation in synchronism with said programme counter during the delay 
interval, third detection means which are coupled to said delay counter so as to 
detect the end of the delay interval and to load said first register. (8pp) 
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DESIGNATED- STATES : DE FR GB IT SE DE FR GB IT SE 
CITED-DOCUMENTS : 2 . Jnl . Ref ; No- SR . Pub 
AP PLICATION -DATA : 

PUB-NO APPL-DATE APPL-NO APPL-DESCRI PTOR 

EP 150535A December 17, 1984 1984EP-0201883 N/A 

US 4792892A May 1, 1987 1 98 7US- 0 04 84 8 1 N/A 

INT-CL (IPC) : G06F 9/40 

ABSTRACTED- PUB -NO: EP 15 053 5A 
BASIC- ABSTRACT: 

Program instructions from a read-only memory (1) are passed to an instruction 
register (2) and control circuit (3) . Address codes are generated by a program 
counter (5) which is stepped by a clock (10) . A loop circuit (20) causes part of 
the program announced by a loop command instruction to be carried out by repeated 
execution of a sequence of instructions. 

An instruction counter (25) provides an end-of -sequence signal which makes an 
instruction address register (3 0) recall from the program counter the address of 
the first instruction of the sequence. A loop counter (35) counts the number of 
executions of the sequence, and signals the end of that part of the program. 

ADVANTAGE - Execution of looped sequence is not conditioned by absolute addresses 
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of instructions. 

ABSTRACTED- PUB -NO: EP 15053 5B 
EQUIVALENT-ABSTRACTS : 

A processor for treating data, comprising: a programme memory (1) for storing a 
plurality of instructions, including a programme loop control instruction which is 
defined by an intial address and a final address and which comprises a repetition 
field for defining the desired number of loop repeats, a programme counter (5) 
with a first step-wise control for applying the addresses to said programme 
memory, loop counting means (35) for counting the number of loop repeats after 
each passage of the initial address, a first register (30) for storing said 
initial address, first detection means for detecting the appearance of the final 
address in order to load the programme counter with the contents of said first 
register, second detection means which are coupled to said counting means and 
which operate as a function of the repetition field so as to inhibit said first 
detection means upon the appearance of the desired number of repeats in order to 
enable the programme counter to abandon said programme loop, characterised in that 
the control instruction also comprises a delay field for defining an addressing 
interval between the control instruction and said initial address, and in that for 
treating said delay field the processor comprises: a delay counter (50) which is 
loaded as a function of the delay field with a second control for operation in 
synchronism with said programme counter during the delay interval, third detection 
means which are coupled to said delay counter so as to detect the end of the delay 
interval and to load said first register. (8pp) 

US 4792892A 

To execute a loop control instruction, calling for repeated execution N times of a 
sequence of "i" instructions, the processor includes a loop circuit. The circuit 
comprises an instruction counter which counts execution of the instructions in the 
loop sequence and produces an end-of- sequence signal upon each completion of the 
loop . 

A register refreshes the program counter with the address of the first instruction 
in the loop in response to each end-of- sequence signal. 

A loop counter counts the number of completions of the loop and delivers a signal 
indicating the end of loop portion of the entire program and enables the program 
counter to continue with the rest the program. The delay in loop execution permits 
initialising of registers in the data processor to permit pipeline execution of 
the loop instruction. 

ADVANTAGE - Synchronisation problems are avoided. 
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1950 becomes enabled. 
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ATTY- AGENT -FIRM: Fish & Richardson P.C. 

ABSTRACT : 

A system and method reorder instructions for effecting faster branch execution. A 
processor element is coupled to receive stored instructions in a first order, and 
to process the received instructions in a different order, the processing 
occurring after each stored instruction of a first type is issued, and after a 
delay time, after each stored instruction of a second type is issued. The delay 
time is based on a delay value associated with the second type of instructions. In 
particular, the instructions include branch and non-branch instruction wherein 
firing time information identifies a time of execution of the branch instruction 
which is a variable number of instructions cycles prior to a time of execution of 
a last to be executed instruction in a basic block. Accordingly, branch 
instructions can be completely executed no later than during the processing of the 
last to be executed non-branch instruction in the basic block thereby speeding up 
overall processing of the software program by the system. 
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counting means have been described for use in a microprocessor, such as U.S. Ser. 
No. 07/048,841, filed May 1, 1987, now issued in U.S. Pat. No. -4,792,892. The 
reference relates to a microprocessor, wherein a program loop instruction has a 
delay field . Through this delay field, the actual start of the loop is delayed 
over a specified number of intervening instructions. After attaining the first 
loop instruction, both the number of instructions within a single execution of 
the loop and the number of successive executions of the loop are counted for so 
attaining a correct execution of the program. The present invention is not 
directed, however, to counting instructions. 
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ABSTRACT : 

A data processor for executing a program of instructions stored in a program 
memory controlled by a program counter. To execute a loop control instruction, 
calling for repeated execution N times of a sequence of "i" instructions, the 
processor includes a loop circuit having an instruction counter which counts 
execution of the instructions in the loop sequence and produces an end-of- sequence 
signal upon each completion of the loop, a register which refreshes the program 
counter with the address of the first instruction in the loop in response to each 
end-of -sequence signal, and a loop counter which counts the number of completions 
of the loop and delivers a signal indicating the end of the loop portion of the 
entire program and enabling the program counter to continue on with the rest of 
the program. The delay in loop execution permits initializing of registers in the 
data processor so as to permit pipeline execution of the loop instruction. 
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An apparatus and method for improving the execution of string instructions is 
provided. The apparatus includes a translator which repetitively generates a micro 
instruction sequence applicable to the particular string operation to be 
performed, and an execution unit for executing the micro instruction sequence. In 
addition, a counter is provided to hold a count value corresponding to the number 
of times the micro instruction sequence is to be executed, and is decremented each 
time the sequence is executed. The translator continues to generate the micro 
instruction sequence until receiving a signal from a counter which indicates that 
all of the string has been operated upon. In addition, the execution unit receives 
the signal from the counter and tests subsequent micro instructions to determine 
whether they are associated with string instructions. If so, the execution unit 
performs NOPs in place of those micro instructions. 

9 Claims, 6 Drawing figures 
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A system and method is provided for executing a low power no operation instruction 
in a data processor (10) with a minimal amount of power consumption. In the 
instruction, an opcode has a mnemonic form of "SLEEP" and an operand which 
specifies a number of timing cycles the instruction should be executed. During 
execution of the SLEEP instruction, the operand is provided to a general register 
(26) in a CPU (12) of the data processor (10) . An ALU (28) accesses the register 
(26) and decrements the operand until the contents are equal to zero. When the ALU 
(28) has decremented the operand to zero, a next software instruction is accessed 
and executed by the data processor. 
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